NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A 65-nm RRAM Compute-in-Memory Macro for Genome Processing

https://doi.org/10.1109/JSSC.2024.3396429

Zhang, Fan; Sridharan, Amitesh; He, Wangxin; Yeo, Injune; Liehr, Maximilian; Zhang, Wei; Cady, Nathaniel; Cao, Yu; Seo, Jae-Sun; Fan, Deliang (July 2024, IEEE Journal of Solid-State Circuits)

This work presents the first resistive random access memory (RRAM)-based compute-in-memory (CIM) macro design tailored for genome processing. We analyze and demonstrate two key types of genome processing applications using our developed CIM chip prototype: the state-of-the-art (SOTA) burrows–wheeler transform (BWT)-based DNA short- read alignment and alignment-free mRNA quantification. Our CIM macro is designed and optimized to support the major functions essential to these algorithms, e.g., parallel XNOR operations, count, addition, and parallel bit-wise and operations. The proposed CIM macro prototype is fabricated with monolithic integration of HfO2 RRAM and 65-nm CMOS, achieving 2.07 TOPS/W (tera-operations per second per watt) and 2.12 G suffixes/J (suffixes per joule) at 1.0 V, which is the most energy-efficient solution to date for genome processing.
more » « less
Full Text Available
APRIS: Approximate Processing ReRAM In-Sensor Architecture Enabling Artificial-Intelligence-Powered Edge

https://doi.org/10.1109/TETC.2024.3480700

Tabrizchi, Sepehr; Gaire, Rebati; Morsali, Mehrdad; Liehr, Maximilian; Cady, Nathaniel; Angizi, Shaahin; Roohi, Arman (January 2024, IEEE Transactions on Emerging Topics in Computing)

Full Text Available
Deep Mapper: A Multi-Channel Single-Cycle Near-Sensor DNN Accelerator

https://doi.org/10.1109/ICRC60800.2023.10386958

Morsali, Mehrdad; Tabrizchi, Sepehr; Liehr, Maximilian; Cady, Nathaniel; Imani, Mohsen; Roohi, Arman; Angizi, Shaahin (December 2023, IEEE)
A 65nm RRAM Compute-in-Memory Macro for Genome Sequencing Alignment

https://doi.org/10.1109/ESSCIRC59616.2023.10268783

Zhang, Fan; He, Wangxin; Yeo, Injune; Liehr, Maximilian; Cady, Nathaniel; Cao, Yu; Seo, Jae-Sun; Fan, Deliang (September 2023, IEEE European Solid State Circuits Conference (ESSCIRC))
Implementation of high-performance and high-yield nanoscale hafnium zirconium oxide based ferroelectric tunnel junction devices on 300 mm wafer platform

https://doi.org/10.1116/6.0002097

Liehr, Maximilian; Hazra, Jubin; Beckmann, Karsten; Mukundan, Vineetha; Alexandrou, Ioannis; Yeow, Timothy; Race, Joseph; Tapily, Kandabara; Consiglio, Steven; Kurinec, Santosh K; et al (January 2023, Journal of Vacuum Science & Technology B)

In this work, hafnium zirconium oxide (HZO)-based 100 × 100 nm2 ferroelectric tunnel junction (FTJ) devices were implemented on a 300 mm wafer platform, using a baseline 65 nm CMOS process technology. FTJs consisting of TiN/HZO/TiN were integrated in between metal 1 (M1) and via 1 (V1) layers. Cross-sectional transmission electron microscopy and energy dispersive x-ray spectroscopy analysis confirmed the targeted thickness and composition of the FTJ film stack, while grazing incidence, in-plane x-ray diffraction analysis demonstrated the presence of orthorhombic phase Pca21 responsible for ferroelectric polarization observed in HZO films. Current measurement, as a function of voltage for both up- and down-polarization states, yielded a tunneling electroresistance (TER) ratio of 2.28. The device TER ratio and endurance behavior were further optimized by insertion of thin Al2O3 tunnel barrier layer between the bottom electrode (TiN) and ferroelectric switching layer (HZO) by tuning the band offset between HZO and TiN, facilitating on-state tunneling conduction and creating an additional barrier layer in off-state current conduction path. Investigation of current transport mechanism showed that the current in these FTJ devices is dominated by direct tunneling at low electric field (E < 0.4 MV/cm) and by Fowler–Nordheim (F–N) tunneling at high electric field (E > 0.4 MV/cm). The modified FTJ device stack (TiN/Al2O3/HZO/TiN) demonstrated an enhanced TER ratio of ∼5 (2.2× improvement) and endurance up to 106 switching cycles. Write voltage and pulse width dependent trade-off characteristics between TER ratio and maximum endurance cycles (Nc) were established that enabled optimal balance of FTJ switching metrics. The FTJ memory cells also showed multi-level-cell characteristics, i.e., 2 bits/cell storage capability. Based on full 300 mm wafer statistics, a switching yield of >80% was achieved for fabricated FTJ devices demonstrating robustness of fabrication and programming approach used for FTJ performance optimization. The realization of CMOS-compatible nanoscale FTJ devices on 300 mm wafer platform demonstrates the promising potential of high-volume large-scale industrial implementation of FTJ devices for various nonvolatile memory applications.
more » « less
Full Text Available
Hybrid RRAM/SRAM In-Memory Computing for Robust DNN Acceleration

https://doi.org/10.1109/TCAD.2022.3197516

Krishnan, Gokul; Wang, Zhenyu; Yeo, Injune; Yang, Li; Meng, Jian; Liehr, Maximilian; Joshi, Rajiv V.; Cady, Nathaniel C.; Fan, Deliang; Seo, Jae-sun; et al (August 2022, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems)

RRAM-based in-memory computing (IMC) effectively accelerates deep neural networks (DNNs) and other machine learning algorithms. On the other hand, in the presence of RRAM device variations and lower precision, the mapping of DNNs to RRAM-based IMC suffers from severe accuracy loss. In this work, we propose a novel hybrid IMC architecture that integrates an RRAM-based IMC macro with a digital SRAM macro using a programmable shifter to compensate for the RRAM variations and recover the accuracy. The digital SRAM macro consists of a small SRAM memory array and an array of multiply-and-accumulate (MAC) units. The non-ideal output from the RRAM macro, due to device and circuit non-idealities, is compensated by adding the precise output from the SRAM macro. In addition, the programmable shifter allows for different scales of compensation by shifting the SRAM macro output relative to the RRAM macro output. On the algorithm side, we develop a framework for the training of DNNs to support the hybrid IMC architecture through ensemble learning. The proposed framework performs quantization (weights and activations), pruning, RRAM IMC-aware training, and employs ensemble learning through different compensation scales by utilizing the programmable shifter. Finally, we design a silicon prototype of the proposed hybrid IMC architecture in the 65nm SUNY process to demonstrate its efficacy. Experimental evaluation of the hybrid IMC architecture shows that the SRAM compensation allows for a realistic IMC architecture with multi-level RRAM cells (MLC) even though they suffer from high variations. The hybrid IMC architecture achieves up to 21.9%, 12.65%, and 6.52% improvement in post-mapping accuracy over state-of-the-art techniques, at minimal overhead, for ResNet-20 on CIFAR-10, VGG-16 on CIFAR-10, and ResNet-18 on ImageNet, respectively.
more » « less
Full Text Available
In-memory Computation of Error-Correcting Codes Using a Reconfigurable HfOx ReRAM 1T1R Array

https://doi.org/10.1109/MWSCAS47672.2021.9531717

Abedin, Minhaz; Liehr, Maximilian; Beckmann, Karsten; Hazra, Jubin; Rafiq, Sarah; Cady, Nathaniel C. (August 2021, 2021 IEEE International Midwest Symposium on Circuits and Systems (MWSCAS))
Investigation of ReRAM Variability on Flow-Based Edge Detection Computing Using HfO ₂ -Based ReRAM Arrays

https://doi.org/10.1109/TCSI.2021.3072210

Rafiq, Sarah; Hazra, Jubin; Liehr, Maximilian; Beckmann, Karsten; Abedin, Minhaz; Pannu, Jodh S.; Jha, Sumit K.; Cady, Nathaniel C. (July 2021, IEEE Transactions on Circuits and Systems I: Regular Papers)

Full Text Available
MR-PIPA: An Integrated Multilevel RRAM (HfO _x )-Based Processing-In-Pixel Accelerator

https://doi.org/10.1109/JXCDC.2022.3210509

Abedin, Minhaz; Roohi, Arman; Liehr, Maximilian; Cady, Nathaniel; Angizi, Shaahin (December 2022, IEEE Journal on Exploratory Solid-State Computational Devices and Circuits)

Search for: All records